Managing Media Content in Multi-Display Systems

ABSTRACT

Techniques for managing media content in multi-display systems are described. For instance, the described techniques can be implemented to manage media content in the context of a client device that includes two or more housings attached via a hinge region such that the housings are pivotable relative to one another. The described techniques, for example, enable dynamic configuration of output of media content based on changes in user position relative to a client device.

BACKGROUND

Today's person is afforded a tremendous selection of devices that arecapable of performing a multitude of tasks. For instance, desktop andlaptop computers provide computing power and screen space forproductivity and entertainment tasks. Further, smartphones and tabletsprovide computing power and communication capabilities in highlyportable form factors. One particularly useful device form factor is afoldable device that provides users with portable communication andcomputing functionality in a highly portable form.

BRIEF DESCRIPTION OF THE DRAWINGS

Aspects of managing media content and call sessions in multi-displaysystems are described with reference to the following Figures. The samenumbers may be used throughout to reference similar features andcomponents that are shown in the Figures. Further, identical numbersfollowed by different letters reference different instances of featuresand components described herein:

FIG. 1 illustrates an example environment in which aspects of managingmedia content and call sessions in multi-display systems can beimplemented.

FIG. 2 a illustrates features of the client device in accordance withone or more implementations.

FIG. 2 b illustrates further features of the client device in accordancewith one or more implementations.

FIG. 3 a depicts an example system for implementing aspects of managingmedia content and call sessions in multi-display systems in accordancewith one or more implementations.

FIG. 3 b depicts an example system for adapting output of media contentbased on a change in user position in accordance with one or moreimplementations.

FIG. 4 a depicts an example system for implementing aspects of managingmedia content and call sessions in multi-display systems in accordancewith one or more implementations.

FIG. 4 b depicts an example system for adapting output of a call sessionbased on a change in user position in accordance with one or moreimplementations.

FIG. 4 c depicts an example system for adapting a call session fordifferent users in accordance with one or more implementations.

FIG. 5 illustrates a flow chart depicting an example method for managingmedia content in multi-display systems in accordance with one or moreimplementations.

FIG. 6 illustrates a flow chart depicting an example method for managingcall sessions in multi-display systems in accordance with one or moreimplementations.

FIG. 7 illustrates a flow chart depicting an example method foridentifying primary audio input for a call session in accordance withone or more implementations.

FIG. 8 illustrates a flow chart depicting an example method for enablinga user to be detected via a camera in accordance with one or moreimplementations.

FIG. 9 illustrates various components of an example device in whichaspects of managing media content and call sessions in multi-displaysystems can be implemented.

DETAILED DESCRIPTION

Techniques for managing media content and call sessions in multi-displaysystems are described. For instance, the described techniques can beimplemented to manage media content and call sessions in the context ofa client device that includes two or more housings attached via a hingeregion such that the housings are pivotable relative to one another. Theclient device, for example, represents a foldable device, e.g., afoldable smartphone.

For instance, consider a scenario where the client device is positionedin an orientation such that a first housing is pivoted to an openposition relative to a second housing. Further, the first housingincludes a first display on a front surface of the housing and a seconddisplay on a rear surface of the housing. In this orientation, forexample, the second housing can be placed on a surface (e.g., a table, adesk, etc.) such that the first display and the second display on thefirst housing are viewable from different respective perspectives.Alternatively or additionally, the client device can be fully openedsuch that the device is in a planar orientation, and placed on a dock orother device holding apparatus in a vertical orientation such that thefirst display and the second display are viewable from differentrespective orientations.

Further to this scenario, media content that includes video content andaudio content is to be output by the client device. To determine how topresent the media content, sensor data is captured at the client deviceand processed to determine that a user is positioned to view the firstdisplay. The sensor data, for instance, includes an image of a usercaptured via a camera and/or speech audio from the user captured by anaudio sensor. Accordingly, based on detecting that the user ispositioned to view the first display, the media content is output viathe first display and an audio device positioned for audio output at thefront surface of the housing.

During output of the media content via the first display, sensor datacontinues to be captured and indicates that the user moves relative tothe first display, e.g., that the user moves out of a field of view of acamera positioned on the front surface of the housing. Further, thesensor data indicates that the user is now positioned such that thesecond display on the rear surface of the housing is viewable by theuser. Accordingly, output of the media content is switched to the seconddisplay and an audio device positioned for audio output at the rearsurface of the housing. In at least one implementation, in response toswitching media output to the second display, output of the mediacontent on the first display is paused or stopped. Such implementationsenable output of media content to dynamically automatically adapt tochanges in user position relative to a client device. Generally, variousimplementations are discussed herein in the context of a user changingpositions relative to a client device, which can occur based on usermovement, repositioning of the client device, and/or combinationsthereof.

In another scenario, consider that a client device is positioned in anopen position such as described above and a call session is implementedvia the client device. The call session, for instance, represents avideo and audio call with a remote device. Accordingly, based ondetermining that a user is positioned to view the first display, visualcontent of the call session is output on the first display and audiocontent of the call session is output by an audio device positioned foraudio output at the front surface of the housing. Further, for the callsession, a camera positioned on the front surface of the client devicecaptures an image of the user and a first audio sensor captures speechcontent from the user. The user image and the speech content, forinstance, are transmitted to a remote device participating in the callsession.

Further to this scenario, sensor data being captured at the clientdevice indicates that the user is repositioned such that the seconddisplay device is now viewable by the user. Accordingly, output of thecall session is switched to the second display device and an audiodevice positioned for audio output at the rear surface of the housing.Further, for the call session, a camera positioned on the rear surfaceof the client device captures an image of the user and a second audiosensor captures speech content from the user. In at least oneimplementation, based on switching output of the call session to thesecond display device, output of the call session via the first displaydevice is paused or stopped. According to various implementations, thisenables output and input for a call session to adapt to changes in userposition relative to a client device.

In another example implementation, consider that a client device ispositioned in an open position such as described above and a callsession is implemented that includes multiple users present in proximityto the client device. For instance, a first user is positioned to viewthe first display at the front surface and a second user is positionedto view the second display at the rear surface. In such scenarios thedescribed techniques are able to adapt to changes in user input from thedifferent users. For instance, when the first user is providing speechinput for the call session, input focus is placed on the first user andcontent of the call session is output via the first display and an audiodevice at the front surface of the client device. The speech input fromthe first user, for example, is emphasized in audio data of the callsession, such as by filtering out audio (e.g., noise) that does notoriginate from the first user. When the second user begins speaking,input focus is placed on the second user, such as by emphasizing speechinput and/or video content from the second user as part of the callsession. For instance, based on detecting the second user via speechinput and/or image detection, content of the call session is output viathe second display and an audio device at the rear surface of the clientdevice. For instance, the call session is output at both the frontsurface and the rear surface of the client device. According to variousimplementations, this enables input and output for a call session todynamically adapt to changes in user presence and user interaction witha call session in multi-user scenarios.

While features and concepts of managing media content and call sessionsin multi-display systems can be implemented in any number ofenvironments and/or configurations, aspects the described techniques aredescribed in the context of the following example systems, devices, andmethods. Further, the systems, devices, and methods described herein areinterchangeable in various ways to provide for a wide variety ofimplementations and operational scenarios.

FIG. 1 illustrates an example environment 100 in which aspects ofmanaging media content and call sessions in multi-display systems can beimplemented. The environment 100 includes a client device 102, acommunication service 104, and a content service 106 that areinterconnectable via network(s) 108. In this particular example, theclient device 102 represents a mobile foldable device, such as afoldable smartphone, a foldable tablet device, a foldable laptop device,and so forth. Example attributes of the client device 102 are discussedbelow with reference to the device 900 of FIG. 9 .

The client device 102 includes various functionality that enables theclient device 102 to perform different aspects of managing media contentand call sessions in multi-display systems discussed herein, including amobile connectivity module 110, sensors 112, display devices 114, audiodevices 116, applications 118, a recognition module 120, and a presentermodule 122. The mobile connectivity module 110 represents functionality(e.g., logic and hardware) for enabling the client device 102 tointerconnect with other devices and/or networks, such as the network108. The mobile connectivity module 110, for instance, enables wirelessand/or wired connectivity of the client device 102.

The sensors 112 are representative of functionality to detect variousphysical and/or logical phenomena in relation to the client device 102,such as motion, light, image detection and recognition, time and date,position, location, touch detection, sound, temperature, and so forth.Examples of the sensors 112 include hardware and/or logical sensors suchas an accelerometer, a gyroscope, a camera, a microphone, a clock,biometric sensors, touch input sensors, position sensors, environmentalsensors (e.g., for temperature, pressure, humidity, and so on),geographical location information sensors (e.g., Global PositioningSystem (GPS) functionality), and so forth. In this particular examplethe sensors 112 include cameras 124, audio sensors 126, and anorientation sensor 128. The sensors 112, however, can include a varietyof other sensor types in accordance with the implementations discussedherein.

The display devices 114 represent functionality for outputting visualcontent via the client device 102. As further detailed below, forinstance, the client device 102 includes multiple display devices 114that can be leveraged for outputting content. The audio devices 116represent functionality for providing audio output for the client device102. In at least one implementation the client device 102 includes audiodevices 116 positioned at different regions of the client device 102,such as to provide for different audio output scenarios. Theapplications 118 represent functionality for performing differentcomputing tasks via the client device 102, such as gaming, mediaconsumption (e.g., content streaming), productivity tasks (e.g., wordprocessing, content generation, data analysis, etc.), contentgeneration, web browsing, communication with other devices, and soforth. In this particular example the applications 118 include acommunication application 118 a which represents functionality forenabling the client device 102 to engage in communication with otherdevices. The communication application 118 a, for instance, represents aportal for interfacing with the communication service 104, such as forenabling communication (e.g., call sessions) between users of differentdevices.

The recognition module 120 represents functionality for recognizingobjects detected by the sensors 112. For instance, utilizing video datacaptured by the cameras 124, the recognition module 120 can recognizevisual objects present in the video data, such as a person. Variousother types of sensor data may additionally or alternatively be used,such as audio data captured by the audio sensor 126. The presentermodule 122 represents functionality for performing various aspectspertaining to managing media content and call sessions in multi-displaysystems in accordance with various implementations. For instance, and asfurther detailed below, the presenter module 122 is operable toconfigure and/or adapt presentation of media content and call sessionsby the client device 102, such as based user position relative to theclient device 102.

FIG. 2 a illustrates features of the client device 102 in accordancewith one or more implementations. The view depicted in FIG. 2 a , forinstance, represents a front-facing view of the client device 102. Theclient device 102 includes a first housing 202 attached to a secondhousing 204 via a hinge region 206. The first housing 202 and/or thesecond housing 204, for instance, are pivotable about the hinge region206 to assume a variety of different angular orientations relative toone another. The first housing 202 includes a display device 114 apositioned on a front surface 208 of the first housing 202, and thesecond housing 204 includes a display device 114 b positioned on a frontsurface 210 of the second housing 204. The client device 102 furtherincludes a camera 124 a positioned on the front surface 208 of the firsthousing 202. The camera 124 a is positionable in various ways, such aswithin the perimeter of the display device 114 a and/or underneath thedisplay device 114 a. Alternatively or additionally, the camera 124 a ispositionable adjacent the display device 114 a.

In the depicted orientation the client device 102 is in a partially openposition 212 with the first housing 202 pivoted away from the secondhousing 204. The first housing 202 is further pivotable about the hingeregion 206 away from the second housing 204 to a fully open position214. In the fully open position 214, for instance, the first housing 202is substantially coplanar with the second housing 204. For example, inthe fully open position 214 the display device 114 a and the displaydevice 114 b are coplanar and form a single integrated display surface.The first housing 202 is also pivotable about the hinge region 206 to aclosed position 216 where the display device 114 a is positioned againstthe display device 114 b. In at least one implementation the orientationsensor 128 is able to detect an orientation of the client device 102,e.g., based on an orientation of the first housing 202 relative to thesecond housing 204. The orientation sensor 128, for instance, can detectan angle of the first housing 202 relative to the second housing 204,and/or an amount of pivoting motion and/or rotation of the hinge region206. Detecting the orientation of the client device 102 can be utilizedfor various purposes, such as for determining how to present mediacontent and/or a call session on the client device 102.

Generally, the partially open position 212 enables different usagescenarios. For instance, in the partially open position 212 the secondhousing 204 can be placed on a surface such as a desk or a table toenable media content to be displayed on the display device 114 a foruser consumption. The client device 102 further includes audio devices116 a that provide audio output such as part of output of media contentby the client device 102 and an audio sensor 126 a that is operable todetect audible input, such speech input from a user of the client device102. In at least one implementation the audio devices 116 a areconfigured to output audio content correlated with video content outputby the display device 114 a.

FIG. 2 b illustrates further features of the client device 102 inaccordance with one or more implementations. The view depicted in FIG. 2b , for instance, represents a rear-facing view of the client device102, such as in the partially open position 212. In this view a rearsurface 218 of the first housing 202 is illustrated, and the rearsurface 218 includes a display device 114 c. Further, the rear surface218 includes a camera 124 b positioned on the rear surface 218 of thefirst housing 202. The camera 124 b is positionable in various ways,such as within the perimeter of the display device 114 c and/orunderneath the display device 114 c. Alternatively or additionally, thecamera 124 b is positionable adjacent the display device 114 c. Theclient device 102 further includes audio devices 116 b and an audiosensor 126 b. The audio devices 116 b, for instance, provide audiooutput, such as part of output of media content by the client device 102via the display device 114 c. The audio sensor 126 b is operable todetect audible input, such speech input from a user of the client device102. In at least one implementation the audio devices 116 b areconfigured to output audio content correlated with video content outputby the display device 114 c.

Having discussed an example environment and device in which thedisclosed techniques can be performed, consider now some examplescenarios and implementation details for implementing the disclosedtechniques.

FIG. 3 a depicts an example system 300 a for implementing aspects ofmanaging media content and call sessions in multi-display systems inaccordance with one or more implementations. Generally, the system 300 acan be implemented in the environment 100 and incorporates features andattributes of the environment 100 introduced above.

In the system 300 a a user 302 is viewing media content 304 displayed onthe display device 114 a of the client device 102. Further, the sensors112 capture various state information pertaining to the client device102 and the user 302. For instance, the orientation sensor 128determines a device orientation 306 of the client device 102, such asbased on orientation and/or angle of the first housing 202 relative tothe second housing 204. In this particular example the deviceorientation 306 indicates that the client device 102 is in the partiallyopen position 212 described previously. In at least some implementationsfunctionality of the client device 102 is controllable based ondifferent device orientations. For instance, based on detecting that theclient device 102 is in the partially open position 212, the presentermodule 122 can perform dynamic switching of media content output, suchas further described below.

Further to the system 300 a the camera 124 a captures image data 308 aand the audio sensor 126 a captures audio data 310 a. The image data 308a, for instance, includes data describing visual objects in view of thecamera 124 a. Further, the audio data 310 a includes data that describesdetected audio, such as speech audio from the user 302.

Accordingly, the recognition module 120 receives the image data 308 aand the audio data 310 a and determines a user state 314 a based on thereceived data. The recognition module 120, for instance, determines thatthe user 302 is in a position to view the media content 304 beingpresented on the display device 114 a and/or is currently viewing themedia content 304. The recognition module 120, for instance, implementsfacial recognition and/or gaze detection to determine that the user 302is viewing the display device 114 a. Alternatively or additionally, therecognition module 120 determines based on the audio data 310 a that theuser 302 is positioned such that the display device 114 a is viewable.Thus, the user state 314 a indicates that the user 302 is positioned toview the display device 114 a.

Further to the system 300 a, the presenter module 122 determines acontent state 316 that indicates that the media content 304 is beingoutput on the display device 114 a. Further, the presenter module 122receives the user state 314 a and the device orientation 306. Thus,based on the content state 316, the user state 314 a, and the deviceorientation 306, the presenter module 122 determines that the user 302is positioned such that the media content 304 is viewable by the user onthe display device 114 a. The presenter module 122, for instance,determines that the user 302 is within a field of view 318 of the camera124 a such that the display device 114 a is viewable by the user 302.

Further, as part of outputting the media content 304, visual content 320is output via the display device 114 a and audio content 322 is outputvia the audio devices 116 a. In at least one implementation the audiodevices 116 a are positioned to direct the audio content 322 into aregion extending from the front surface 208 of the first housing 202,such as to direct the audio content 322 toward a user that is viewingthe display device 114 a.

While the system 300 a is discussed in the context of using the imagedata 308 a and the audio data 310 a to detect user position relative tothe client device 102, at least some implementations are implementableto use a subset of this data. For instance, to determine the user state314 a the recognition module 120 can utilize the image data 308 a alone,the audio data 310 a alone, and/or a combination of any suitable type ofsensor data.

According to various implementations the recognition module 120 receivesimage data 308 a and/or audio data 310 a in real time and dynamicallyupdates the user state 314 a. For instance, when the position of theuser 302 changes relative to the client device 102 (e.g., based onmovement of the user 302 and/or the client device 102), the recognitionmodule 120 updates the user state 314 a to identify the change inposition of the user 302. When the user state 314 a indicates that aposition of the user 302 changes a threshold amount (e.g., to outside ofthe field of view 318) the presenter module 122 can perform variousactions. For instance, based on determining that the position of theuser 302 changes a threshold amount, the presenter module 122 performsan action to change output of the media content 304 to adapt to thechange in position of the user 302.

FIG. 3 b depicts an example system 300 b for adapting output of mediacontent based on a change in user position in accordance with one ormore implementations. Generally, the system 300 b can be implemented inthe environment 100 and incorporates features and attributes of theenvironment 100 introduced above. The system 300 b, for instance,represents a continuation of the system 300 a.

In the system 300 b the user 302 moves to a different position relativeto the client device 102 and/or repositions the client device 102. Theuser 302, for instance, moves out of the field of view 318 of the camera124 a and moves into a field of view 324 of the camera 124 b. In atleast one implementation, in response to detecting that the user 302moves out of the field of view 318, the presenter module 122 activatesthe camera 124 b to determine if the user 302 is detected in the fieldof view 324.

Accordingly, with the user 302 repositioned relative to the clientdevice 102, the sensors 112 continue capturing state informationpertaining to the user 302 and the client device 102. For instance, theorientation sensor 128 determines the device orientation 306, e.g., thatthe client device 102 remains in the partially open position 212.Further, the camera 124 b captures image data 308 b and the audio sensor126 b captures audio data 310 b. The recognition module 120 receives theimage data 308 b and the audio data 310 b and determines a user state314 b based on the received data. The recognition module 120, forinstance, determines that the user 302 is positioned such that thedisplay device 114 c is viewable, e.g., is within the field of view 324of the camera 124 b. As referenced above, the recognition module 120 canutilize a particular instance and/or combination of sensor data todetermine user position such as the image data 308 b, the audio data 310b, and/or any suitable combination of sensor data. Accordingly, the userstate 314 b indicates that the user 302 is positioned such that thedisplay device 114 c is viewable.

The presenter module 122 receives the user state 314 b data anddetermines that the user 302 has moved and is now positioned such thatthe display device 114 c is viewable. The presenter module 122implements a content reposition 326 operation to move presentation ofthe media content 304 to the display device 114 c of the client device102. For instance, output of the visual content 320 is moved to thedisplay device 114 c, and output of the audio content 320 is moved tothe audio devices 116 b. In at least one implementation, in conjunctionwith the content reposition 326, output of the media content 304 on thedisplay device 114 a and the audio content on the audio devices 116 a isstopped.

The implementations discussed in the systems 300 a, 300 b aredynamically operable to respond to changes in user position and toreposition content output accordingly. For instance, if the user 302subsequently moves again and is positioned within the field of view 318to view the display device 114 a as depicted in the system 300 a, thepresenter module 122 can perform a further content reposition 326 toreposition output of the media content 304 onto the display device 114 aand the audio devices 116 a.

FIG. 4 a depicts an example system 400 a for implementing aspects ofmanaging media content and call sessions in multi-display systems inaccordance with one or more implementations. The system 400 a can beimplemented in the environment 100 and incorporates features andattributes of the environment 100 introduced above.

In the system 400 a the user 302 is participating in a call session 402via the client device 102. The call session 402, for instance,represents a video call implemented via the communication application118 a and the communication service 104. The call session 402 includesvisual content 404 output via the display device 114 a and audio content406 output via an audio device 116 a. The visual content 404 and theaudio content 406, for instance, include visual and audio contentreceived from a remote device and represent a remote user of the remotedevice. Further, the visual content 404 and the audio content 406include visual and audio content captured at the client device 102 andtransmitted to the remote device.

In the system 400 a the sensors 112 capture various state informationpertaining to the client device 102 and the user 302. For instance, theorientation sensor 128 determines a device orientation 408 of the clientdevice 102, such as based on orientation and/or angle of the firsthousing 202 relative to the second housing 204. In this particularexample the device orientation 408 indicates that the client device 102is in the partially open position 212 described previously.

Further, the camera 124 a captures image data 410 a and the audio sensor126 captures audio data 412 a. The image data 410 a, for instance,includes data describing visual objects in view of the camera 124 a. Theaudio data 412 a includes data that describes detected audio, such asspeech audio from the user 302. The image data 410 a and the audio data412 a are included in the visual content 404 and the audio content 406,respectively, such as for transmission to a remote device participatingin the call session 402. Accordingly, the recognition module 120receives the image data 410 a and the audio data 412 a and determines auser state 414 a based on the received data. The recognition module 120,for instance, determines that the user 302 is in a position to view thecall session 402 being presented on the display device 114 a and/or iscurrently viewing the call session 402. In at least one implementationthe recognition module 120 implements facial recognition and/or gazedetection to determine that the user 302 is viewing the display device114 a. Alternatively or additionally, the recognition module 120determines based on the audio data 412 a that the user 302 is positionedsuch that the display device 114 a is viewable. Thus, the user state 414a indicates that the user 302 is positioned to view the display device114 a.

Further to the system 400 a, the presenter module 122 determines a callstate 416 that indicates that the call session 402 is being output onthe display device 114 a and the audio device 116 a. The presentermodule 122 receives the user state 414 a and the device orientation 408and based on the call state 416, the user state 414 a, and the deviceorientation 408, the presenter module 122 determines that the user 302is positioned such that the call session 402 is viewable by the user onthe display device 114 a. The presenter module 122, for instance,determines that the user 302 is within the field of view 318 of thecamera 124 a such that the display device 114 a is viewable by the user302.

According to various implementations the recognition module 120 receivesimage data 410 a and/or audio data 412 a in real time and dynamicallyupdates the user state 414 a. For instance, when the position of theuser 302 changes relative to the client device 102 (e.g., based onmovement of the user 302 and/or the client device 102), the recognitionmodule 120 updates the user state 414 a to identify the change inposition of the user 302. When the user state 414 a indicates that aposition of the user 302 changes a threshold amount (e.g., to outside ofthe field of view 318) the presenter module 122 can perform variousactions. For instance, based on determining that the position of theuser 302 changes a threshold amount, the presenter module 122 performsan action to change output of the call session 402 to adapt to thechange in position of the user 302.

FIG. 4 b depicts an example system 400 b for adapting output of a callsession based on a change in user position in accordance with one ormore implementations. The system 400 b can be implemented in theenvironment 100 and incorporates features and attributes of theenvironment 100 introduced above. The system 400 b, for instance,represents a continuation of the system 400 a.

In the system 400 b the user 302 moves to a different position relativeto the client device 102 and/or repositions the client device 102 whilethe call session 402 is in progress. The user 302, for instance, movesout of the field of view 318 of the camera 124 a and moves into a fieldof view 324 of the camera 124 b. In at least one implementation, inresponse to detecting that the user 302 moves out of the field of view318, the presenter module 122 activates the camera 124 b to determine ifthe user 302 is detected in the field of view 324.

Accordingly, with the user 302 repositioned relative to the clientdevice 102, the sensors 112 continue capturing state informationpertaining to the user 302 and the client device 102. For instance, theorientation sensor 128 determines the device orientation 408, e.g., thatthe client device 102 remains in the partially open position 212.Further, the camera 124 b captures image data 410 b and the audio sensor126 captures audio data 412 b. The recognition module 120 receives theimage data 410 b and the audio data 412 b and determines a user state414 b based on the received data. The recognition module 120, forinstance, determines that the user 302 is positioned such that thedisplay device 114 c is viewable, e.g., is within the field of view 324of the camera 124 b. As referenced above, the recognition module 120 canutilize a particular instance and/or combination of sensor data todetermine user position such as the image data 410 b, the audio data 412b, and/or any suitable combination of sensor data. Accordingly, the userstate 414 b indicates that the user 302 is positioned such that thedisplay device 114 c is viewable.

The presenter module 122 receives the user state 414 b data anddetermines that the user 302 has moved and is now positioned such thatthe display device 114 c is viewable. Accordingly, based on the callstate 416 indicating that the call session 402 is in progress and theuser state 414 b, the presenter module 122 implements a call reposition418 operation to move presentation of the call session 402 to thedisplay device 114 c of the client device 102. For instance, output ofthe visual content 404 is moved to the display device 114 c, and outputof the audio content 406 is moved to the audio devices 116 b. In atleast one implementation, in conjunction with the call reposition 418,output of the visual content 404 on the display device 114 a and theaudio content 406 on the audio devices 116 a is stopped.

The implementations discussed in the systems 400 a, 400 b aredynamically operable to respond to changes in user position and toreposition content output accordingly. For instance, if the user 302subsequently moves again and is positioned within the field of view 318to view the display device 114 a as depicted in the system 400 a, thepresenter module 122 can perform a further call reposition 418 toreposition output of the visual content 404 onto the display device 114a and the audio content 406 onto the audio devices 116 a.

FIG. 4 c depicts an example system 400 c for adapting a call session fordifferent users in accordance with one or more implementations. Thesystem 400 c can be implemented in the environment 100 and incorporatesfeatures and attributes of the environment 100 introduced above. Thesystem 400 c, for instance, represents a continuation of the systems 400a, 400 b.

In the system 400 c the call session 402 is in progress on the clientdevice 102. Further, the user 302 is positioned within the field of view318 of the camera 124 a and a user 420 is positioned within the field ofview 324 of the camera 124 b. The cameras 124 a, 124 b capture imagedata 410 c and the audio sensors 126 a, 126 b capture audio data 412 c.The recognition module 120 receives the image data 410 c and the audiodata 412 c and generates user state 414 c data. The user state 414 cindicates that the user 302 is detected as being positioned to view thedisplay device 114 a (e.g., within the field of view 318) and the user420 is positioned to view the display device 114 c, e.g., within thefield of view 324. Thus, based on the user state 414 c indicating thatthe users 302, 420 are positioned to view the respective display devices114 a, 114 c, the presenter module 122 causes the call session 402 to beoutput via both of the display devices 114 a, 114 c and the audiodevices 116 a, 116 b. The user state 414 c also indicates that the user420 is interacting with the call session 402. The audio data 412 c, forinstance, indicates voice input from the user 420. Alternatively oradditionally, the image data 410 c indicates that the user 420 istalking, e.g., based on mouth movement detected from the user 420.

Accordingly, based on the user state 414 c indicating audio input fromthe user 420, the presenter module 122 performs a focus reposition 422that focuses on voice input from the user 420. For instance, the userstate 414 c indicates that speech input to the call session 402 switchesfrom the user 302 to the user 420. Since the location of the user 420 isknown based on the image data 410 c and the audio data 412 c, thepresenter module 122 causes the audio content 406 to focus on speechinput received from the user 420. In at least one implementation thepresenter module 122 implements and/or causes to be implemented audiobeamforming that focuses on speech input from the user 420 detected inthe audio data 412 c for inclusion in the audio content 406. Further,the presenter module 122 can apply a visual enhancement to an image ofthe user 420 obtained from the image data 410 c for inclusion in thevisual content 404, such as to provide a visual indication that the user420 is speaking.

The system 400 c is implementable to perform multiple focus repositions422 to dynamically adapt to changes in user interaction with the callsession 402. For instance, as the audio data 412 c indicates that speechinput switches between the users 302, 420, the user state 414 cindicates changes in which user 302, 420 is currently speaking and thusfocus repositions 422 are performed to focus the audio content 406 onwhich of the users 302, 420 are speaking.

While implementations are discussed herein with reference to the clientdevice 102 in the partially open position 212, it is to be appreciatedthat the described techniques can be applied with the client device 102in a variety of different orientations. For instance, the client devicecan be opened to the fully open position 214 such that the device is ina planar or semi-planar orientation, and placed on a dock or otherdevice holding apparatus in a vertical orientation such that the displaydevice 114 a and the display device 114 c are viewable from differentrespective orientations. In this orientation the techniques describedherein can be applied such as for managing media content and/or callsessions.

FIG. 5 illustrates a flow chart depicting an example method 500 formanaging media content in multi-display systems in accordance with oneor more implementations. At 502 it is determined that media content isto be output via a client device. The presenter module 122, forinstance, determines that a user performs an action to indicate thatmedia content is to be output via the client device 102. The mediacontent, for instance, includes digital visual content and optionally,digital audio content.

At 504 it is detected that a first housing of the client device ispositioned in an open position relative to a second housing of theclient device. For example, the presenter module 122 determines that theclient device 102 is positioned in the partially open position 212.Generally, orientation of the client device 102 can be determined invarious ways, such as based on an angle of the first housing 202relative to the second housing 204, a rotational orientation of thehinge region 206, based on a spatial position of the first housing 202and/or the second housing 204, and/or combinations thereof

At 506 a position of a user relative to a first display device of theclient device is monitored. The recognition module 120, for example,receives image data from a camera 124 and/or audio data from an audiosensor 126 and determines a user position relative to a display device114 of the client device 102, such as the display device 114 a and/orthe display device 114 c. At 508 media content is caused to be outputvia the first display device based on detecting that the user ispositioned to view the first display device. The presenter module 122,for example, determines that a user is positioned to view the displaydevice 114 a, such as based on user state data received from therecognition module 120.

At 510 sensor data is received indicating that a position of the userchanges such that the user is positioned to view a second display deviceof the client device. The recognition module 120, for example, receivessensor data from the sensors 112 indicating that the user isrepositioned relative to the client device 102, such as based on imagedata received from a camera 124 and/or audio data received from an audiosensor 126. In at least one implementation the recognition module 120generates updated user state data that indicates the change in userposition.

At 512, based on the change in the position of the user, the mediacontent is caused to be output on the second display device. Thepresenter module 122, for example, receives an indication of the changein user position such that the second display device is viewable by theuser (e.g., based on updated user state data) and causes the mediacontent to be output via the second display device. Further, where themedia content includes audio content, output of the audio content isswitchable to an audio device 116 that is positioned to output audiocontent related to visual content output by the second display device.In at least one implementation, based on outputting the media contentvia the second display device, output of the media content via the firstdisplay device is paused and/or stopped.

The method 500 is performable in real-time to respond dynamically tochanges in user position over time. For instance, if the user isrepositioned again relative to the client device 102 such that the firstdisplay device is detected as being viewable by the user, output of themedia content is switchable to the first display device and optionally,output of the media content via the second display device is pausedand/or stopped.

FIG. 6 illustrates a flow chart depicting an example method 600 formanaging call sessions in multi-display systems in accordance with oneor more implementations. At 602 it is determined that a call session isto be output via a client device. The presenter module 122, forinstance, determines that a user performs an action to indicate thatcall session is to be output via the client device 102. The user, forexample, invokes the communication application 118 a to initiate a callsession and/or to join a call session.

At 604 it is detected that a first housing of the client device ispositioned in an open position relative to a second housing of theclient device. For example, the presenter module 122 determines that theclient device 102 is positioned in the partially open position 212.Generally, orientation of the client device 102 can be determined invarious ways, examples of which are discussed above.

At 606 a position of a user relative to a first display device of theclient device is monitored. The recognition module 120, for example,receives image data from a camera 124 and/or audio data from an audiosensor 126 and determines a user position relative to a display device114 of the client device 102, such as the display device 114 a and/orthe display device 114 c. At 608 the call session is caused to be outputvia the first display device based on detecting that the user ispositioned to view the first display device. The presenter module 122,for example, determines that a user is positioned to view the displaydevice 114 a, such as based on user state data received from therecognition module 120.

At 610 sensor data is received indicating that a position of the userchanges such that the user is positioned to view a second display deviceof the client device. The recognition module 120, for example, receivessensor data while the call session is active from the sensors 112indicating that the user is repositioned relative to the client device102, such as based on image data received from a camera 124 and/or audiodata received from an audio sensor 126. In at least one implementationthe recognition module 120 generates updated user state data thatindicates the change in user position.

At 612, based on the change in the position of the user, the callsession is caused to be output on the second display device. Thepresenter module 122, for example, receives an indication of the changein user position such that the second display device is viewable by theuser (e.g., based on updated user state data) and causes the callsession to be output via the second display device. In at least oneimplementation, output of audio content of the call session isswitchable to an audio device 116 that is positioned to output audiocontent related to visual content output by the second display device.In at least one implementation, based on outputting the call session viathe second display device, output of the call session via the firstdisplay device is paused and/or stopped.

The method 600 is performable in real-time to respond dynamically tochanges in user position over time. For instance, if the user isrepositioned again relative to the client device 102 such that the firstdisplay device is detected as being viewable by the user, output of thecall session is switchable to the first display device and optionally,output of the call session via the second display device is pausedand/or stopped.

FIG. 7 illustrates a flow chart depicting an example method 700 foridentifying primary audio input for a call session in accordance withone or more implementations. In at least one implementation the method700 is performable in conjunction with the method 600, such asdynamically during a call session. At 702 audio input from a first userfor a call session is caused to be captured via a first audio sensor ofa client device based on detecting speech audio at the first audiosensor. For instance, the recognition module 120 receives audio dataindicating the speech audio is detected as being directed toward theaudio sensor 126 a at the first surface 208 of the first housing 202 ofthe client device 102. In at least one implementation the recognitionmodule 120 utilizes audio beamforming to identify a direction of thespeech audio relative to the first surface 208. The presenter module 122utilizes the audio input to generate audio content for the call session,such as for transmission to a remote device participating in the callsession. In at least one implementation the detected speech audiooriginates from a first user that is participating in the call session.

Further, video data is captured of the first user that generates thespeech audio and is utilized to generate video content for the callsession, such as for transmission to the remote device. The camera 124 aat the first surface 208, for example, captures an image of the user andgenerates video content that includes the image of the user, such as alive image of the user while the user is speaking. In conjunction withcapturing the video and audio content of the first user, video and audiocontent of the call session is output for the first user at a firstdisplay device (e.g., the display device 114 a) and a first audiodevice, e.g., the audio device 116 a.

At 704 it is determined that further speech audio from a second user isdetected at a second audio sensor of the client device. For instance,the recognition module 120 receives audio data indicating the speechaudio is detected as being directed toward the audio sensor 126 b at thesecond surface 210 of the first housing 202 of the client device 102. Inat least one implementation the recognition module 120 utilizes audiobeamforming to identify a direction of the speech audio relative to thesecond surface 210. Alternatively or additionally video data from acamera 124 (e.g., the camera 124 b) is captured that identifies thepresence of a user that is positioned at the second surface 210 of thefirst housing of the client device 102. The user, for instance,represents a second user that is present in proximity to the clientdevice 102 concurrently with the first user.

At 706 further audio input for the call session from the second user iscaptured via the second audio sensor based on detecting the furtherspeech audio at the second audio sensor. The presenter module 122, forinstance, utilizes the further audio input to generate audio content forthe communication session. At 708 the further audio input from thesecond user is configured as active audio for the call session based ondetecting the further speech audio at the second audio sensor. Forinstance, the presenter module 122 prioritizes the further audio inputover audio input received at the first audio sensor, such as to audiblyemphasize speech input received at the second audio sensor. As mentionedabove, for example, audio beamforming can be utilized to focus on audioreceived from a user that provides the further speech audio. Forinstance, audio beamforming can be utilized to filter out audible sounds(e.g., noise) that does not originate from the user that provides thefurther speech audio.

Further, video data of the second user that that provides the furtherspeech audio is captured and utilized as video content for the callsession. In at least one implementation the second user that providesthe further speech is visually identified in the video content as anactive user, e.g., a user that currently has focus in the call session.

At 710 the call session is output for the first user and the seconduser. For instance, video content and audio content of the call sessionare output (e.g., simultaneously) via the display devices 114 a, 114 cand the audio devices 116 a, 116 b, such as to enable the first user andthe second user to simultaneously view and interact with the callsession. In at least one implementation, if the first user and/or thesecond user are detected as moving out of view of a particular displaydevice 114 and/or camera 124, output of the call session via arespective display device can be paused and/or stopped.

The method 700 is performable in real-time to respond dynamically tochanges in detected speech input over time. For instance, as usersparticipating in the call session via the client device 102 alternatebetween providing speech input detected by the different audio sensors,audio focus is applied to the speech input from the respective users forgenerating audio content for the call session.

FIG. 8 illustrates a flow chart depicting an example method 800 forenabling a user to be detected via a camera in accordance with one ormore implementations. The method 800, for instance, is implemented inconjunction with the method 500 and/or the method 600 to enable userrepositioning relative to a client device to be detected.

At 802 it is detected that a user moves out of the field of view of afirst camera of a client device. The presenter module 122, for instance,receives user state data from the recognition module 120 indicating thata user moves out of the field of view of the camera 124 a. For example,the user was previously detected as being positioned to view the displaydevice 114 a and then moves out of a field of view of the camera 124 a.The user can be detected as moving out of the field of view of the firstcamera while media content is being output on the client device, and/orwhile a call session is in progress on the client device.

At 804 a second camera of the client device is notified to scan for theuser within the field of view of the second camera. For example, thepresenter module 122 notifies the camera 124 b to scan for an image ofthe user in a field of view of the camera 124 b. In an implementationwhere image data from the camera 124 b indicates that the user isdetected such that the display device 114 c is viewable by the user,output of media content and/or a call session can be switched to thedisplay device 114 c, such as described above. In at least oneimplementation this enables the camera 124 b to be maintained in a lowpower mode and/or a power off mode until the camera 124 b is notified toscan for an image of the user.

The example methods described above may be performed in various ways,such as for implementing different aspects of the systems and scenariosdescribed herein. Generally, any services, components, modules, methods,and/or operations described herein can be implemented using software,firmware, hardware (e.g., fixed logic circuitry), manual processing, orany combination thereof. Some operations of the example methods may bedescribed in the general context of executable instructions stored oncomputer-readable storage memory that is local and/or remote to acomputer processing system, and implementations can include softwareapplications, programs, functions, and the like. Alternatively or inaddition, any of the functionality described herein can be performed, atleast in part, by one or more hardware logic components, such as, andwithout limitation, Field-programmable Gate Arrays (FPGAs),Application-specific Integrated Circuits (ASICs), Application-specificStandard Products (ASSPs), System-on-a-chip systems (SoCs), ComplexProgrammable Logic

Devices (CPLDs), and the like. The order in which the methods aredescribed is not intended to be construed as a limitation, and anynumber or combination of the described method operations can beperformed in any order to perform a method, or an alternate method.

FIG. 9 illustrates various components of an example device 900 in whichaspects of managing media content and call sessions in multi-displaysystems can be implemented. The example device 900 can be implemented asany of the devices described with reference to the previous FIGS. 1-8 ,such as any type of client device, mobile phone, mobile device, wearabledevice, tablet, computing, communication, entertainment, gaming, mediaplayback, and/or other type of electronic device. For example, theclient device 102 as shown and described with reference to FIGS. 1-9 maybe implemented as the example device 900.

The device 900 includes communication transceivers 902 that enable wiredand/or wireless communication of device data 904 with other devices. Thedevice data 904 can include any of device identifying data, devicelocation data, wireless connectivity data, and wireless protocol data.Additionally, the device data 904 can include any type of audio, video,and/or image data. Example communication transceivers 902 includewireless personal area network (WPAN) radios compliant with various IEEE902.15 (Bluetooth™) standards, wireless local area network (WLAN) radioscompliant with any of the various IEEE 902.11 (Wi-Fi™) standards,wireless wide area network (WWAN) radios for cellular phonecommunication, wireless metropolitan area network (WMAN) radioscompliant with various IEEE 902.16 (WiMAX™) standards, and wired localarea network (LAN) Ethernet transceivers for network data communication.

The device 900 may also include one or more data input ports 906 viawhich any type of data, media content, and/or inputs can be received,such as user-selectable inputs to the device, messages, music,television content, recorded content, and any other type of audio,video, and/or image data received from any content and/or data source.The data input ports may include USB ports, coaxial cable ports, andother serial or parallel connectors (including internal connectors) forflash memory, DVDs, CDs, and the like. These data input ports may beused to couple the device to any type of components, peripherals, oraccessories such as microphones and/or cameras.

The device 900 includes a processing system 908 of one or moreprocessors (e.g., any of microprocessors, controllers, and the like)and/or a processor and memory system implemented as a system-on-chip(SoC) that processes computer-executable instructions. The processorsystem may be implemented at least partially in hardware, which caninclude components of an integrated circuit or on-chip system, anapplication-specific integrated circuit (ASIC), a field-programmablegate array (FPGA), a complex programmable logic device (CPLD), and otherimplementations in silicon and/or other hardware. Alternatively or inaddition, the device can be implemented with any one or combination ofsoftware, hardware, firmware, or fixed logic circuitry that isimplemented in connection with processing and control circuits, whichare generally identified at 910. The device 900 may further include anytype of a system bus or other data and command transfer system thatcouples the various components within the device. A system bus caninclude any one or combination of different bus structures andarchitectures, as well as control and data lines.

The device 900 also includes computer-readable storage memory 912 (e.g.,memory devices) that enable data storage, such as data storage devicesthat can be accessed by a computing device, and that provide persistentstorage of data and executable instructions (e.g., softwareapplications, programs, functions, and the like). Examples of thecomputer-readable storage memory 912 include volatile memory andnon-volatile memory, fixed and removable media devices, and any suitablememory device or electronic data storage that maintains data forcomputing device access. The computer-readable storage memory caninclude various implementations of random access memory (RAM), read-onlymemory (ROM), flash memory, and other types of storage media in variousmemory device configurations. The device 900 may also include a massstorage media device.

The computer-readable storage memory 912 provides data storagemechanisms to store the device data 904, other types of informationand/or data, and various device applications 914 (e.g., softwareapplications). For example, an operating system 916 can be maintained assoftware instructions with a memory device and executed by theprocessing system 908. The device applications may also include a devicemanager, such as any form of a control application, softwareapplication, signal-processing and control module, code that is nativeto a particular device, a hardware abstraction layer for a particulardevice, and so on. Computer-readable storage memory 912 represents mediaand/or devices that enable persistent and/or non-transitory storage ofinformation in contrast to mere signal transmission, carrier waves, orsignals per se. Computer-readable storage memory 912 do not includesignals per se or transitory signals.

In this example, the device 900 includes a recognition module 918 a anda presenter module 918 b that implement aspects of managing mediacontent and call sessions in multi-display systems and may beimplemented with hardware components and/or in software as one of thedevice applications 914. For example, the recognition module 918 a canbe implemented as the recognition module 120 and the presenter module918 b can be implemented as the presenter module 122, described indetail above. In implementations, the recognition module 918 a and/orthe presenter module 918 b may include independent processing, memory,and logic components as a computing and/or electronic device integratedwith the device 900. The device 900 also includes presentation data 920for implementing aspects of managing media content and call sessions inmulti-display systems and may include data from the recognition module918 a and/or the presenter module 918 b.

In this example, the example device 900 also includes a camera 922 andmotion sensors 924, such as may be implemented in an inertialmeasurement unit (IMU). The motion sensors 924 can be implemented withvarious sensors, such as a gyroscope, an accelerometer, and/or othertypes of motion sensors to sense motion of the device. The variousmotion sensors 924 may also be implemented as components of an inertialmeasurement unit in the device.

The device 900 also includes a wireless module 926, which isrepresentative of functionality to perform various wirelesscommunication tasks. For instance, for the client device 102, thewireless module 926 can be leveraged to scan for and detect wirelessnetworks, as well as negotiate wireless connectivity to wirelessnetworks for the client device 102. The device 900 can also include oneor more power sources 928, such as when the device is implemented as amobile device. The power sources 928 may include a charging and/or powersystem, and can be implemented as a flexible strip battery, arechargeable battery, a charged super-capacitor, and/or any other typeof active or passive power source.

The device 900 also includes an audio and/or video processing system 930that generates audio data for an audio system 932 and/or generatesdisplay data for a display system 934. The audio system and/or thedisplay system may include any devices that process, display, and/orotherwise render audio, video, display, and/or image data. Display dataand audio signals can be communicated to an audio component and/or to adisplay component via an RF (radio frequency) link, S-video link, HDMI(high-definition multimedia interface), composite video link, componentvideo link, DVI (digital video interface), analog audio connection, orother similar communication link, such as media data port 936. Inimplementations, the audio system and/or the display system areintegrated components of the example device. Alternatively, the audiosystem and/or the display system are external, peripheral components tothe example device.

Although implementations of managing media content and call sessions inmulti-display systems have been described in language specific tofeatures and/or methods, the subject of the appended claims is notnecessarily limited to the specific features or methods described.Rather, the features and methods are disclosed as exampleimplementations, and other equivalent features and methods are intendedto be within the scope of the appended claims. Further, variousdifferent examples are described and it is to be appreciated that eachdescribed example can be implemented independently or in connection withone or more other described examples. Additional aspects of thetechniques, features, and/or methods discussed herein relate to one ormore of the following:

In addition to the previously described methods, any one or more of thefollowing:

In some aspects, the techniques described herein relate to a clientdevice including: a first housing foldably attached to a second housingvia a hinge; a first display device positioned at a first surface of thefirst housing and a second display device positioned at a second surfaceof the first housing; a sensor system including one or more sensors; andone or more modules implemented at least in part in hardware of theclient device to: cause media content to be output via the first displaydevice based on detecting that a user is positioned to view the firstdisplay device; receive sensor data from the sensor system indicatingthat a position of the user changes such that the user is positioned toview the second display device; and cause, based on the change in theposition of the user, the media content to be output on the seconddisplay device.

In some aspects, the techniques described herein relate to a clientdevice, wherein the sensor data further includes an indication that theclient device is in an open position based on a position of the firsthousing relative to the second housing.

In some aspects, the techniques described herein relate to a clientdevice, wherein the indication that the client device is in an openposition is based at least in part on detecting a position of the hingerelative to one or more of the first housing or the second housing.

In some aspects, the techniques described herein relate to a clientdevice, wherein the one or more modules are further implemented tooutput the media content via a first audio device based on detectingthat the user is positioned to view the first display device, and toswitch output of the media content to a second audio device based on thechange in the position of the user.

In some aspects, the techniques described herein relate to a clientdevice, wherein the one or more modules are further implemented to stopoutput of the media content via the first display device based on thechange in the position of the user.

In some aspects, the techniques described herein relate to a clientdevice, wherein the sensor data includes image data indicating that theuser data is detected within a field of view of a camera positioned atthe second surface of the first housing of the client device.

In some aspects, the techniques described herein relate to a clientdevice, wherein the sensor data includes audio data indicating thatspeech content from the user is detected at the second surface of thefirst housing of the client device.

In some aspects, the techniques described herein relate to a clientdevice, wherein the detecting that the user is positioned to view thefirst display device is based at least in part on detecting the userwithin a field of view of a first camera positioned at the first surfaceof the first housing of the client device, and wherein the one or moremodules are further implemented to: detect that the user moves out ofthe field of view of the first camera; and notify a second camerapositioned at the second surface of the first housing to scan for animage of the user.

In some aspects, the techniques described herein relate to a systemincluding: one or more processors; and one or more computer-readablestorage media storing instructions that are executable by the one ormore processors to: detect that a first housing of a client device ispositioned in an open position relative to a second housing of theclient device; cause media content to be output via a first displaydevice positioned at a first surface of the first housing based ondetecting that a user is positioned to view the first display device;receive sensor data from a sensor system indicating that a position ofthe user changes such that the user is positioned to view a seconddisplay device positioned at a second surface of the first housing ofthe client device; and cause, based on the change in the position of theuser, the media content to be output on the second display device.

In some aspects, the techniques described herein relate to a system,wherein to detect that the first housing of the client device ispositioned in an open position relative to the second housing of theclient device is based on detecting an orientation of a hinge thatconnects the first housing to the second housing.

In some aspects, the techniques described herein relate to a system,wherein to cause the media content to be output via the first displaydevice is based on to detect an image of the user within a field of viewof a first camera positioned at the first surface of the first housingof the client device, and wherein the sensor data from the sensor systemindicating that a position of a user changes such that the user ispositioned to view a second display includes an indication that an imageof the user is detected within a field of view of a second camerapositioned at the second surface of the first housing of the clientdevice.

In some aspects, the techniques described herein relate to a system,wherein to cause the media content to be output via the first displaydevice is based on to detect speech content from the user adjacent thefirst surface of the first housing of the client device, and wherein thesensor data from the sensor system indicating that a position of a userchanges such that the user is positioned to view a second displayincludes detecting speech content from the user adjacent the secondsurface of the first housing of the client device.

In some aspects, the techniques described herein relate to a system,wherein detecting that the user is positioned to view the first displaydevice is based at least in part on detecting the user within a field ofview of a first camera positioned at the first surface of the firsthousing of the client device, and wherein the instructions are furtherexecutable by the one or more processors to: detect that the user movesout of the field of view of the first camera; and notify a second camerapositioned at the second surface of the first housing to scan for animage of the user.

In some aspects, the techniques described herein relate to a system, andwherein the instructions are further executable by the one or moreprocessors to stop output of the media content on the first displaydevice in response to output of the media content on the second displaydevice.

In some aspects, the techniques described herein relate to a system,wherein the instructions are further executable by the one or moreprocessors to output the media content via a first audio device based ondetecting that the user is positioned to view the first display device,and to switch output of the media content to a second audio device basedon the change in the position of the user.

In some aspects, the techniques described herein relate to a system,wherein the instructions are further executable by the one or moreprocessors to: receive further sensor data from the sensor systemindicating that the position of a user further changes such that theuser is positioned to view the first display device; and cause, based onthe further change in the position of the user, the media content to beoutput on the first display device; and cause output of the mediacontent via the second display device to stop.

In some aspects, the techniques described herein relate to a method,including: monitoring a position of a user relative to a first displaydevice of a client device; causing media content to be output via thefirst display device based on detecting that the user is positioned toview the first display device; receiving sensor data from a sensorsystem indicating that a position of the user changes such that the useris positioned to view a second display device of the client device; andcausing, based on the change in the position of the user, the mediacontent to be output on the second display device.

In some aspects, the techniques described herein relate to a method,wherein the client device includes a first housing attached to a secondhousing via a hinge, the first display device is positioned at a firstsurface of the first housing, the second display device is positioned ata second surface of the first housing, and wherein the monitoring theposition of the user relative to the first display device is based atleast in part on: determining that the client device is in an openposition based on a position of the first housing relative to the secondhousing; and determining that the media content is to be output via theclient device.

In some aspects, the techniques described herein relate to a method,wherein detecting that the user is positioned to view the first displaydevice is based on detecting the user within a field of view of a firstcamera of the client device, and wherein the sensor data indicating thatthe position of the user changes such that the user is positioned toview a second display device of the client device includes detecting theuser with a field of view of a second camera of the client device.

In some aspects, the techniques described herein relate to a method,further including: detecting that the user moves out of the field ofview of the first camera; and notifying the second camera to scan forthe user within the field of view of the second camera.

1. A client device comprising: a first housing foldably attached to asecond housing via a hinge; a first display device positioned at a firstsurface of the first housing and a second display device positioned at asecond surface of the first housing; a sensor system including one ormore sensors; and one or more modules implemented at least in part inhardware of the client device to: cause media content to be output viathe first display device based on detecting that a user is positioned toview the first display device; receive sensor data from the sensorsystem indicating that a position of the user changes such that the useris positioned to view the second display device; and cause, based on thechange in the position of the user, the media content to be output onthe second display device.
 2. The client device as recited in claim 1,wherein the sensor data further comprises an indication that the clientdevice is in an open position based on a position of the first housingrelative to the second housing.
 3. The client device as recited in claim2, wherein the indication that the client device is in an open positionis based at least in part on detecting a position of the hinge relativeto one or more of the first housing or the second housing.
 4. The clientdevice as recited in claim 1, wherein the one or more modules arefurther implemented to output the media content via a first audio devicebased on detecting that the user is positioned to view the first displaydevice, and to switch output of the media content to a second audiodevice based on the change in the position of the user.
 5. The clientdevice as recited in claim 1, wherein the one or more modules arefurther implemented to stop output of the media content via the firstdisplay device based on the change in the position of the user.
 6. Theclient device as recited in claim 1, wherein the sensor data comprisesimage data indicating that the user data is detected within a field ofview of a camera positioned at the second surface of the first housingof the client device.
 7. The client device as recited in claim 1,wherein the sensor data comprises audio data indicating that speechcontent from the user is detected at the second surface of the firsthousing of the client device.
 8. The client device as recited in claim1, wherein the detecting that the user is positioned to view the firstdisplay device is based at least in part on detecting the user within afield of view of a first camera positioned at the first surface of thefirst housing of the client device, and wherein the one or more modulesare further implemented to: detect that the user moves out of the fieldof view of the first camera; and notify a second camera positioned atthe second surface of the first housing to scan for an image of theuser.
 9. A system comprising: one or more processors; and one or morecomputer-readable storage media storing instructions that are executableby the one or more processors to: detect that a first housing of aclient device is positioned in an open position relative to a secondhousing of the client device; cause media content to be output via afirst display device positioned at a first surface of the first housingbased on detecting that a user is positioned to view the first displaydevice; receive sensor data from a sensor system indicating that aposition of the user changes such that the user is positioned to view asecond display device positioned at a second surface of the firsthousing of the client device; and cause, based on the change in theposition of the user, the media content to be output on the seconddisplay device.
 10. The system of claim 9, wherein to detect that thefirst housing of the client device is positioned in an open positionrelative to the second housing of the client device is based ondetecting an orientation of a hinge that connects the first housing tothe second housing.
 11. The system of claim 9, wherein to cause themedia content to be output via the first display device is based on todetect an image of the user within a field of view of a first camerapositioned at the first surface of the first housing of the clientdevice, and wherein the sensor data from the sensor system indicatingthat a position of a user changes such that the user is positioned toview a second display comprises an indication that an image of the useris detected within a field of view of a second camera positioned at thesecond surface of the first housing of the client device.
 12. The systemof claim 9, wherein to cause the media content to be output via thefirst display device is based on to detect speech content from the useradjacent the first surface of the first housing of the client device,and wherein the sensor data from the sensor system indicating that aposition of a user changes such that the user is positioned to view asecond display comprises detecting speech content from the user adjacentthe second surface of the first housing of the client device.
 13. Thesystem of claim 9, wherein detecting that the user is positioned to viewthe first display device is based at least in part on detecting the userwithin a field of view of a first camera positioned at the first surfaceof the first housing of the client device, and wherein the instructionsare further executable by the one or more processors to: detect that theuser moves out of the field of view of the first camera; and notify asecond camera positioned at the second surface of the first housing toscan for an image of the user.
 14. The system of claim 9, and whereinthe instructions are further executable by the one or more processors tostop output of the media content on the first display device in responseto output of the media content on the second display device.
 15. Thesystem of claim 9, wherein the instructions are further executable bythe one or more processors to output the media content via a first audiodevice based on detecting that the user is positioned to view the firstdisplay device, and to switch output of the media content to a secondaudio device based on the change in the position of the user.
 16. Thesystem of claim 9, wherein the instructions are further executable bythe one or more processors to: receive further sensor data from thesensor system indicating that the position of a user further changessuch that the user is positioned to view the first display device; andcause, based on the further change in the position of the user, themedia content to be output on the first display device; and cause outputof the media content via the second display device to stop.
 17. Amethod, comprising: monitoring a position of a user relative to a firstdisplay device of a client device; causing media content to be outputvia the first display device based on detecting that the user ispositioned to view the first display device; receiving sensor data froma sensor system indicating that a position of the user changes such thatthe user is positioned to view a second display device of the clientdevice; and causing, based on the change in the position of the user,the media content to be output on the second display device.
 18. Themethod as recited in claim 17, wherein the client device comprises afirst housing attached to a second housing via a hinge, the firstdisplay device is positioned at a first surface of the first housing,the second display device is positioned at a second surface of the firsthousing, and wherein said monitoring the position of the user relativeto the first display device is based at least in part on: determiningthat the client device is in an open position based on a position of thefirst housing relative to the second housing; and determining that themedia content is to be output via the client device.
 19. The method asrecited in claim 17, wherein detecting that the user is positioned toview the first display device is based on detecting the user within afield of view of a first camera of the client device, and wherein thesensor data indicating that the position of the user changes such thatthe user is positioned to view a second display device of the clientdevice comprises detecting the user with a field of view of a secondcamera of the client device.
 20. The method as recited in claim 19,further comprising: detecting that the user moves out of the field ofview of the first camera; and notifying the second camera to scan forthe user within the field of view of the second camera.